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Question 1 [7 marks] 


Let Y; follow a Binomial(n,,p,) distribution and Y> follow a Binomial(n», p2) distribu- 
tion. Let p) = Y)/n, and po = Yo/ng. Recall that the expected value and variance of a 
Binomial(n,p) are np and np(1 — p). 


(a) Show that E(p; — fo) =m — po. /1 mark] 
(b) Show that V(p1 — po) = pi (1 — px) /ny + po(l — po)/ne [2 marks] 


T'wo brands of lawn mowers, denoted A and B, are each under warranty for 2 years. In 
a random sample of 60 lawnmowers of brand A, 14 failed before the warranty period was 
over. For brand B, 28 of 61 lawnmowers failed before the warranty period was over. 


(c) Estimate the difference in proportion (p4— pg) of lawnmowers that failed before the 
warranty period was over between brand A (p,4) and brand B (pz). /1 mark] 


(d) Use the normal approximation to the binomial to calculate an approximate 95% 


confidence interval for p4 — pp. [2 marks] 
(e) Carefully interpret the confidence interval from (d). /1 mark] 
Question 2 [4 marks] 
Suppose that Y), Yo,..., ¥, denote a random sample from a Poisson(A) distribution. (Re- 
call, p(y) = *&.) 
(a) Write the likelihood L(A|Y1, Yo,..., Yn) /1 mark] 
(b) Show that 5° Y; is sufficient for 2. /1 mark] 
(c) Show that the MLE for \ is A= STY;/n. /1 mark] 
(d) Show that \ is unbiased and consistent. /1 mark 


a) 
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Question 3 [3 marks] 


An additive is given to the diet of mn = 50 cows. The levels of the additive are 0, 0.1%, 
0.2%, or 0.3%. It is hoped that the additive will increase the level of milk fat. It is also 
thought that age and the initial weight of the cow may play a role in the production of 
milk fat. 

The variables are Y (% increase in milk fat), x, (additive %), x2 (age in months), and 
x3 (initial weight in kg). Data for the first few cows follows: 


y Ly £9 L3 
3.88 Q 49 618 
3.40 0 47 681 
344 Q 36 975 


A researcher considers three models (M1, M2, M3), where: 
0 oe ae Do + Bix He 


V2: ¥ = Bo sa D124 a Boxe ae 


M3. Y = Bo Si ByXy a Bo + Bax a ae 


(a) How many residual degrees of freedom are there for M3? /1 mark] 


(b) M3 can be re-written in matrix format as Y = X. Write the first three rows of the 
design matrix X. (Hint: don’t forget the intercept.) /1 mark/ 


(c) For M1, a 95% prediction interval was estimated for zj] = 0.1 as (2.59, 4.36). Inter- 
pret. [1 mark/ 
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Question 4 [5 marks] 


Refer to Question 4, Model M2. In this model, Y = 8) + 612, + Boxg +. 

R output for a linear regression of milk fat % (Y) versus % additive (x1) and age in 
months (x2) and are shown below (you may assume all regression assumptions have been 
met): 


Call: 
Im(formula = Y ~ xi + x2, data = milk.study) 


Coefficients: 
Estimate Std. Error t value Pr(>/t]) 
(Intercept) 3.028999 0.177281 17.086 < 2e-16 
2.088736 0.549342 3.802 0.000413 
0.005571 0.003283 1.697 0.096347 


Residual standard error: 0.4259 on 47 degrees of freedom 
Multiple R-squared: 0.2587, Adjusted R-squared: 0.2271 
F-statistic: 8.199 on 2 and 47 DF, p-value: 0.0008819 


a) Write down the regression equation relating milk fat % to % additive and age. 
g q g g 


[1 mark] 
(b) Is there evidence that the additive increases the level of milk fat? [1 mark/ 
(c) Interpret the R? statistic. [1 mark] 


(d) A 45 month-old cow that received 0.2% additive in her feed had a milk fat level of 
4.29%. Calculate the residual for this cow. [1 mark] 


(e) In order to achieve 5% milk fat, the regression equation predicts that a 45 month-old 
cow would need 0.8% additive. Explain why this prediction might be misleading. 
[1 mark] 
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Question 5 [4 marks] 


In capture-mark-recapture (CMR) studies, researchers capture and mark animals over a 
number of occasions in order to study demographic features of the population. Animals 
that are captured may be captured multiple times or just once. Capture-recapture studies 
generally assume that capture rates are the same for all animals, but this is not always the 
case. Some animals can become trap-dependent, either being ‘trap-happy’ or ‘trap-shy’. 
For example, if animals are lured with food, previously captured animals may learn to be 
recaptured, while if the capture procedure is unpleasant, previously captured animals may 
learn to avoid capture. 

A Chi-Squared test for trap-dependence assumes that whether an animal is missed 
or caught on occasion t is independent from whether the animal is missed or caught on 
occasion t+1. If the null hypothesis of independence is rejected, the interpretation is that 
there is evidence of trap-dependence. 

Consider the following table of observed captures for n = 155 animals, comparing cap- 
tures on the 3rd and 4th capture occasions. 


Missed © t+1=4 Caught @ t+1=4 


Missed @ t=3 60 30 
Caught © t=3 40 20 


(a) How many degrees of freedom are there for this Chi-Square Test? [1 mark/ 


(b) Calculate &;; for the number of animals that would be missed on both occasions. 
(Recall, Loe = RC; /n) /1 mark] 


(c) Calculate Cueto for the number of animals that would be missed on both occa- 
sions. [1 mark] 


(d) The p-value for this test is 0.63. Interpret. {1 mark 
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Question 6 7 | e _ | 2 4 | | [7 marks] . 


In a randomised block design, for = 1,2 Seng ON 9 1a 2 iste gD: 


be. = p++ Bj + &; 
where Yj; is the observation on treatment 7 in block 7, py is the overall mean, 7; is the 


nonrandom effect of treatment 7, where ae 7, = 0, B; is the nonrandom effect of block 
4, we ea By = 0, and the independent random error terms €;; ~ N(0, 07). 


Let Yi. => ay j=1 Vig denote the average of all responses in treatment 7. 


(a) Show that Y;, = w+auts ee Eig | | — [1 mark] | 


(b) For treatments i and i where 4 # i!, show that E (Y;, —Y¥y.)=%—Ty ~~ [1 mark] 
(c) Show that VY, —Yy.)=207/b . [1 mark] 


A randomised block experiment was conducted to examine the treatment effect of three 
methods of soil preparation (A: no preparation, B: light fertilisation, C: burning) on pine 
seedling growth (Y, measured in cm). There were four locations (LA, D2, 13, L4) treated 
as blocks due to possible differences in soil quality. Each location was then divided into 
three plots, with each method of soil prepartation randomly assigned to a plot in each 
location. 

7 that a ian _ interval for the difference of treatment means is given by 


oo Yo. + tajes/2/ 


(d) What are the total sample size and residual df for this experiment? - ; [1 mark] 


(e) Calculate and interpret the 95% confidence interval for the difference between light 
fertilisation (method B) and burning (method C) if Yp. = 14.5, Yo. = 10.5, s = 1.6, 
and ty = 2.45. [2 marks] 


(f) How many locations would be needed to estimate the difference in mean growth be- 
tween any two soil preparation methods to within 1 cm? (use tg/p % 2 and s = 1.6; 
treat ‘within’ as referring to the distance between the point estimate and the upper 
limit of the confidence interval). | [1 mark] 


wererndaurereg sete eeinersses eivetaccesostnedsv 4a PEIN GRO RERUN TeSRERSENGERAES HOSSEIN ESRI HSE YO SOSIRRSS ES IBIS ESESRO ARGH eA CBSA ARTF NTGCE RFPS MI TA EMD ASEM OE NEPA Fs SALE ES EH AEN EP TET EEE SESE REND NOTRE EPI SS TNS SIO NEDA SNES EE WE SEITREENER IER ERAT 7 GEI EN PETS LE RIESE ESL AEA ETERS EUSA ET ETO SERCH NESTE GEOINT MRT SIOSAI SEIT PIERS EEO ETM 
DESH ALANSIEPPSAL ERAN EER PEIRUP HOPI NSEEIENAEN reo SPeRRONESIE SEO ses 2 z ers = se 


Ca FIR WIR RHEE NEO SEEDPEER EPROPS DIE SSH BS SLEVIN DEE PSN ATID EPEC 2 POKES TAPER EERESEREA 


anlar smirk bre rte ete OBSR SA or eA MARNIE eta AMR NIRONNA EE EE TEI EAE NEA IFES LESSEE 460K NIOAERNIC SHEP EERIE A IRRREVENESON AUDA) 


_rereamarecrseaseraen BURR ESOIR OYA NE enters NA aLens TAROT NE ARNE 
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